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. Abstract 

^ ■ The problem of power allocation is studied for a multiple-input multiple-output (MIMO) decode-and-forward 

(DF) two-way relaying system consisting of two source nodes and one relay. It is shown that achieving maximum 
sum-rate in such a system does not necessarily demand the consumption of all available power at the relay. Instead, the 
maximum sum-rate can be achieved through efficient power allocation with minimum power consumption. Deriving 
such power allocation, however, is nontrivial due to the fact that it generally leads to a nonconvex problem. In Part I of 
C/3 ' this two-part paper, a sum-rate maximizing power allocation with minimum power consumption is found for MIMO 

o ■ 

DF two-way relaying, in which the relay optimizes its own power allocation strategy given the power allocation 
strategies of the source nodes. An algorithm is proposed for efficiently finding the optimal power allocation of the 
^ , relay based on the proposed idea of relative water-levels. The considered scenario features low complexity due to the 

fact that the relay optimizes its power allocation without coordinating the source nodes. As a trade-off for the low 
\ complexity, it is shown that there can be waste of power at the source nodes because of no coordination between the 

relay and the source nodes. Simulation results demonstrate the perfomance of the proposed algorithm and the effect 
of asymmetry on the considered system. 
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I. Introduction 



I Two-way relaying (TWR) has recently attracted significant interests |[T]- ifTTl . Establishing bi-directional links 

. between one relay and two source nodes, the information exchange between the source nodes can be accomplished 

in two time slots HI. In the first time slot (first phase) the source nodes simultaneously transmit their messages to 
the relay while in the second time slot (second phase) the relay forwards the messages to the destinations. The first 
phase is called the multiple access (MA) phase while the second phase is the broadcasting (BC) phase of TWR. 
Compared to conventional one-way relaying, which needs four time slots for the information exchange between the 
source nodes, TWR can achieve a higher spectral efficiency |ID. 
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As the performance of TWR depends on the transmit strategies of both the source nodes and the relay, optimizing 
the transmit strategies such as power allocation and beamforming is one of the main research interests in TWR. 
The transmit strategies of the relay and source nodes depend on the relaying scheme. Similar to one-way relaying, 
the relaying scheme in TWR can be amplify-and-forward (AF), decode-and-forward (DF), etc., depending on the 
manner that the received information is processed at the relay before it is forwarded to the destinations. In the AF 
TWR scheme, the relay amplifies and broadcasts the signals received from the source nodes while it also amplifies 
and forwards the noise at the relay. Sum-rate maximization for multiple-input multiple-output (MIMO) AF TWR in 
which the relay and the source nodes all occupy multiple antennas is investigated in H, while a mean square 
error minimizing scheme for MIMO AF TWR is studied in ||5]. For MIMO AF relaying, low-complexity sub-optimal 
solutions can be obtained through diagonalizing the MIMO channel based on the singular value decomposition (S VD) 
or the generalized SVD (GSVD) and thereby transferring the problem of beamforming/precoding to the problem of 
power allocation O, |l5|. Finding the optimal solution, on the other hand, usually requires iterative algorithms with 
high complexity iHJ, ||5|. The main challenge in investigating AF TWR, especially AF MIMO TWR, is the strong 
coupling between the transmit strategies of the source nodes and the relay due to noise propagation. As the result 
of noise propagation, the optimization over the transmit strategies of the source nodes and the relay usually leads 
to nonconvex problems. For example, the information rate of the communication in either direction is a nonconvex 
function of the covariance/beamforming matrices of the sources and the relay [1]. 

Unlike AF relaying, DF relaying does not have the problem of noise propagation. As a result, DF TWR may 
achieve a better performance than AF TWR, especially at low signal-to-noise ratios (SNRs), at the cost of higher 
complexity. Moreover, optimizing the power allocation in DF relaying usually leads to convex problems (see for 
example (|6] and Q). DF TWR has been studied in JSl- ifTSll . The optimal power allocation for DF TWR is studied 
under a fairness constraint in ifTZI . The optimal time division between the MA and BC phases and the optimal 
distribution of the relay's power for achieving weighted sum-rate maximization are studied in lfT3l . While the above 
two works assume a single antenna at both the sources and the relay, the case with multiple antennas at all nodes 
is investigated in lfT4l - ifTSll . The achievable rate region and the optimal transmit strategies of both the source nodes 
and the relay are studied in [T4], where the relay's optimal transmit strategy is found by two water-filling based 
solutions coupled by the relay's power limit. The authors of lITSl specifically investigate the optimal transmit strategy 
in the BC phase of the MIMO DF TWR. It is shown that there may exist different strategies that lead to the same 
point in the rate region. Given that TWR can achieve a high spectral efficiency, it is of interest to optimize the 
power allocation so that the TWR scheme achieves high spectral efficiency using minimum power consumption. 
Unlike AF TWR, in which the sum-rate can always be increased when the relay has more transmission power, the 
maximum sum-rate of DF TWR can be achieved without consuming all the available power at the relay. However, 
finding the sum-rate maximizing power allocation with minimum power consumption is no longer a convex problem 
in general. Part I of this two-part paper studies the problem of finding the optimal relay power allocation which 
minimizes the relay power consumption among all relay power allocations that achieve the maximum sum-rate 
for the MIMO DF TWR given the power allocation of the source nodes. For brevity, this problem is called the 
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sum-rate maximization with minimum (relay) power consumption. The considered scenario is referred to as relay 
optimization scenario. The objective of Part I of this two-part paper is to find the optimal power allocation strategy 
of the relay in the relay optimization scenario^ The contributions of this part are as follows. 

First, we show that the considered problem of sum-rate maximization with minimum relay power consumption is 
nonconvex. As the minimization of the relay power consumption is considered, the problem becomes more complex 
and the method used for deriving the optimal relay power allocation strategy in |7| and |14| is no longer valid. 
We first prove the sufficient and necessary condition for a relay power allocation to be optimal in the considered 
relay optimization scenario. Then, based on this condition, we propose an efficient algorithm for finding the optimal 
solution. The proposed algorithm can obtain the optimal relay power allocation in several steps without iterations, 
i.e., low complexity is achieved. 

Second, we show that while the relay optimization scenario has the advantage of low complexity, as a trade-off 
it may lead to a waste of power at the source nodes because of the lack of coordination between the source nodes 
and the relay. We analyze the solution of the relay optimization problem for different relay power limits and show 
that a waste of power at the source nodes happens when the relay has a power limit less than a certain threshold 
for each considered system configuration and the thresholds are also given. 

Third, the effect of asymmetry on the considered MIMO DF TWR is analyzed and demonstrated. It has been 
observed in lfT6l . iflTl that the asymmetry on channel gain, relay's location, etc., can cause a performance degradation 
in single-input single-output (SISO) TWR. We extend this to the MIMO case and show the effect of asymmetry in 
power limits and number of antennas at the source nodes through analysis and simulations. 

The rest of the paper is organized as follows. Section |II]gives the system model of this work. The relay optimization 
problem is solved and the features of the solution are investigated in Section Hill Simulation results are shown in 
Section JV] and Section |V] concludes the paper Section [Vl] "Appendix" provides proofs for some lemmas and all 
theorems. 

II. System Model 

Consider a TWR with two source nodes and one relay, where source node i {i = 1,2) and the relay have 
and Tir antennas, respectively. In the MA phase, source node i transmits signal WiS,; to the relay. Here is the 
precoding matrix of source node i and is the complex Gaussian information symbol vector of source node i. The 
elements of Si,Vi are independent and identically distributed with zero mean and unit variance. The channels from 
source node i to the relay and from the relay to source node i are denoted as Hjr and H^j, respectively. Receiver 
channel state information is assumed at both the relay and the source nodes, i.e., source node i knows H^i and the 
relay knows Hii-,Vi. It is also assumed that the relay knows Hri,Vi by using either channel reciprocity or channel 
feedback. The received signal at the relay in the MA phase is 

y, = HirWiSi + H2rW2S2 + n, (1) 

'Some preliminary results were presented at a conference II18I . 
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where rii is the noise at the relay with covariance matrix a^I in which I denotes the identity matrix. The maximum 
transmission power of source node i is Hmited to p™'^^. Define the transmit covariance matrices = W^Wf , Vi, 
in which (•)^ stands for the conjugate transpose, and let D = [Di,D2]. Then the sum-rate of the MA phase is 
bounded by lfT9ll 

I+(HirDiHH+H2rD2H«)(fT?)"' 



R'^^iB) = log 



(2) 



where | • | denotes determinant. In the BC phase, the relay decodes Si and S2 from the received signal, re-encodes 
messages using superposition coding and transmits the signal 

Xr = Tr2Sl + T,.lS2 (3) 

where Ti; is the x rij relay precoding matrix for relaying the signal from source node j to source node 
The maximum transmission power of the relay is limited to P™^'^. Note that in addition to the above superposition 
coding, the Exclusive-OR (XOR) based network coding is also used at the relay in the literature ll20l - ll22l . While 
XOR based network coding may achieve a better performance than superposition coding, it relies on the symmetry 
of the traffic from the two source nodes. The asymmetry in the traffic in the two directions can lead to a significant 
degradation in the performance of XOR in TWR 11211 . Il22l . As the general case of TWR is considered and there is 
no guarantee of traffic symmetry, the approach of symbol-level superposition is assumed here at the relay as it is 
considered in |[T] and |[T3l . Moreover, for the MIMO case as considered in this work, the superposition scheme can 
take advantage of the MIMO channels. In the superposition scheme, the relay uses separate beamformers for the 
signals towards two directions, which guarantees that each transmitted signal is optimal (subject to the transmission 
power constraints) given its MIMO channel. This cannot be achieved if the relay uses XOR based network coding. 

The received signal at source node i is 

= H,.,xi. + n, (4) 

where is the noise at source node i with covariance matrix afl. With the knowledge of Hi-, and Ti^, source 
node i subtracts the self-interference H^iTj-jSi from the received signal and the equivalent received signal at source 
node i is 

y., = H„T„Sj + rii. (5) 

Define = TriT]^,Vi and let B = [Bi,B2]. The sum-rate of the considered DF TWR can be written as |[T], 
Ca, ||20l 

i?'*(B, D) = i min{i?'"^(D), i?(B, D)} (6) 

where 

i?(B,D) = min{4i(Bi),i?2r(D2)} 

+ min{4.2(B2),i?i,(Di)}, (7) 

^It is assumed as default thi'oughout the paper that the user index i and j satisfy i ^ j. 
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in which 

%.(D,) = log |I + (H,,D,HH)(a2)-i|, (8) 

and 

R„{B,) = log|I+ (H„B,H«)((7f)"i|. (9) 
For brevity of presentation, we define the following sum-rate of the BC phase 

i?'^^(B) = 4i(Bi)+42(B2) (10) 

to represent the summation in the above equation hereafter 

For the relay optimization scenario considered here, the relay maximizes the sum-rate in ^ using minimum 
transmission power given the power allocation strategies of the source nodesj^ Since the relay needs to know Wi 
and W2 for decoding Si and S2, respectively, as well as for designing Tii and Tr2, the source nodes should send 
their respective precoding matrices to the relay after they decide their transmit strategies. Similarly, the relay should 
also send Tji and T12 to both source nodes. 

Given the above system model, we next solve the relay optimization problem. 

III. Relay optimization 

In the relay optimization scenario, the relay and the source nodes do not coordinate in choosing their respec- 
tive power allocation strategies. Instead, the relay aims at maximizing _R*^(B,D) in ^ with minimum power 
consumption after the source nodes decide their strategies and inform the relay. 

Denote the power allocation that the source nodes decide to use as D° = [D^, D^jQpor maximizing the sum-rate 
given D°, the relay solves the following optimization problem^ 

max i?*^(B,D°) (11a) 

s.t. Tr{Bi + B2} < P;;"^"". (lib) 

The problem (fTTT i is convex. However, in order to find the optimal B with minimum Tr{Bi+B2} among all possible 
B's that achieve the same maximum of the objective function in (fTTT i. extra constraints need to be considered. Two 

'The term 'sum-rate' by default means i?*"(B, D) when we do not specify it to be the sum-rate of the BC or MA phase. 

'^The source nodes may determine their power allocation strategies using different objectives. Note that different source node power allocation 
strategies lead to different solutions of the relay optimization problem. However, the approach adopted for solving the relay optimization problem 
is valid for arbitrary source node power allocation. 

^The positive semi-definite constraints D,; 0,Vi and B; >; 0,Vj are assumed as default and omitted for brevity in all formations of 
optimization problems in this paper. 
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necessary constraintj^ are 

i?„(B,) <i?,r(D°),Vz (12a) 
i?(B,D°) < ff"'^(D"). (12b) 

The constraint ( I12ab is necessary because, due to the expression of i?(B,D) in (|7]i, the power consumption 
of the relay can be reduced without decreasing the sum-rate i?'"(B,D) in (|6]l given D*^ by reducing Tr{Bi} if 
Rri(Bi) > RjiCD^). Note that ( I12ab is not necessarily satisfied with equality at optimality. In fact, it can be shown 
that ( I12ab should be satisfied with inequality for at least one i at optimality using subsequent results in Section lTll-BI 
It can also be shown that ( I12al ) can be satisfied with inequalities for both i's at optimality even if the relay has an 
unUmited power budget. We stress that (112a) is not sufficient for obtaining the optimal solution. Other constraints 
are also needed including ( I12bb . The constraint (I12bb is also necessary because if it is not satisfied given D", then 
the power consumption of the relay can be reduced without decreasing the sum-rate i?*^(B,D°) by decreasing 
i?(B,DO) so that R{B,T)°) = i?'"'^(DO). 

The constraints in (fT2T l make the considered problem nonconvex. The objective in this section is to find an efficient 
method of deriving the optimal power allocation of the relay in the considered scenario of relay optimization. It 
is straightforward to see that the power allocation of the relay should be based on waterfilling for relaying the 
signal in either direction regardless of how the relay distributes its power between relaying the signals in the two 
directions. This is due to the fact that the BC phase is interference free since both source nodes are able to subtract 
their self-interference. If the objective were to maximize i?^'^(B) instead of i?'*(B,D"), the optimal strategy of 
the relay could be found via a simple search. Indeed, in that case, we could find the optimal power allocation 
of the relay and consequently the optimal B by searching for the optimal proportion that the relay distributes its 
power between relaying the signals in the two directions. However, such approach is infeasible for the considered 
problem. The reason is that first of all it is unknown what is the total power that the relay uses in the optimal 
solution. As power efficiency is also considered, the relay may not use full power in its optimal strategy. Moreover, 
from the expression of i?'"(B,D) in (|6]l, it can be seen that the maximum achievable i?'"'(B,D'') also depends 
on _Rir(D5), /i2i (D2)' ™d ff"''(D"). Due to this dependence, the two constraints in (fT2] | are necessary for the 
considered problem of sum-rate maximization with minimum power consumption. However, these two constraints 
are implicit in the sense that they are constraints on the rates instead of on the power allocation of the relay. Such 
constraints offer no insight in finding the optimal B. In order to transform the above mentioned dependence of 
i?*'^(B,D°) on ^ii.(D?), ^2r(D2), and i?™''(D°) into an explicit form, and to discover the insight behind the 
constraints in (fT2] i. we next propose the idea of relative water-levels and develop a method based on this idea. 

'These two necessary constraints are introduced here to show that the considered relay optimization problem is nonconvex. For the sufficient 
and necessary condition that a power allocation strategy is optimal in terms of maximizing sum-rate with minimum power consumption, please 
see Theorem 2. 
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A. Relative water-levels 

Denote the rank of Hr^ as r^j and the singular value decomposition (SVD) of H^i as UriflriV^- Assume that the 
first diagonal elements of fti-i are non-zero, sorted in descending order and denoted as a;ri(l), . . . , uj,-i{T,:i), while 
the last minimi, rii} — r^i diagonal elements are zeros. Define 2i — {1, . . . , iri}, Vi and ai{k) — \ujri{k)\'^ /af ,yk e 
Xi^i. For a given D — [Di,D2], define /zi(Di), ^2(02), and /^nia(D) such that 

log (1 + (^^"2(fc) - l)A = i?ir(Di) (13a) 

E log + - 1)^) = ^2r(D2) (13b) 

where (•)+ stands for projection to the positive orthant. The physical meaning of /^^(Di) is that if waterfilling 
is performed on LLi,:j{k)\, Vfc G X, using the water-level l//ii(Di), then the information rate of the transmission 
from the relay to source node j using the resulting waterfilling -based power allocation achieves precisely RiriTii). 
The physical meaning of /ima(D) is that if waterfilling is performed on tJr,;(fc)'s, Vfc e Ii,Vi using the water- 
level l//i,na(D), then the sum-rate of the transmission from the relay to the two source nodes using the resulting 
waterfilling-based power allocation achieves precisely i?'"''(D). Note that l//ii(Di),Vi and l//ini(D) are not the 
actual water-levels for the MA or the BC phase. They are just relative water-levels introduced to transfer and 
simplify the constraints in (fTZt . Denote the actual water-levels used by the relay for relaying the signal from source 
node j to source node i as l/Ai,Vi. With water-level 1/A,;, can be given as = VriPri(Ai)Vj^ where 
Pri(Ai) = diag^( J- — a\i) )^ ■< ■ • ■ ' (x"~ a (r ) )^' ^' • ■ • ' which diag(-) stands for making a diagonal matrix 
using the given elements, (■)+ stands for projection to the positive orthant, and 0„^_r^; stands for all-zero matrix of 
size (rir — Tri) X [n-c — Vri). The power allocated on LOn{k) is pri{k) = (l/A^ — 1/Q;i(fc))^, Vfc G li, Vt. The resulting 
rate R-di'^i) is given by ^ log(l + (Q;i(fc)/Ai — l)^). Using /xi(Di), /i2(D2), and /Xma(D), the constraints in 
(I12al l can be rewritten as 

A, >/^,(D°),Vi (14a) 
^^log(l + (la.(fc)-l) 

Given ( I13al ) and jlBbl l. it is not difficult to see that ( I12al ) is equivalent to (I14ab . Moreover, the equivalence between 
( fT2bl i and ( fT4bl i can be explained as follows. Given D° and ( fT2bl i. i?'*(B,D°) in ( fTTal i becomes i?(B,D°)/2. 
Given (I12al) . or equivalently (I14ab . i?(B, D) in Q with D = D° becomes ^i.i(Bi) + -Ri2(B2). Then, substituting 
the left-hand side of ( |12b| ) with ^11(61) + _Ri.2(B2), i.e., i?'^'^(B) in ([TOll, and using dBcK the constraint (|14bt is 
obtained. 

The procedure for the relay optimization can be summarized in the following three steps: 
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1. Obtain Mi(D?), ^^2(0^), and /imalD") from D^; 

2. Determine the optimal A^; 

3. Obtain P,i(Ai) and from A^. 

The first and the third steps are straightforward given the definitions ( I13al l- (ll3cl l and (??). Therefore, finding the 
optimal A,; , Vi in the second step is the essential part to be dealt with later in this section. 

From hereon, /ii(Di), /i2(D2), and /inia(D) are denoted as /ii, /i2 and /ima, respectively, for brevity. The same 
markers/superscripts on and/or D are used on /i^ and/or /ima to represent the connection. For example, /ii(D°) 
and /iina(D) are briefly denoted as /i° and /ima, respectively. The rate i?ii(Bi) obtained using water-level l/Xi is 
also denoted as R^i{Xi). 

B. Algorithm for relay optimization 

Using the relative water-levels /ii, Vi and /ima, we can now develop the algorithm for relay optimization. In order 
to do that, the following lemmas are presented. 
Lemma 1: l//ima < max{l//ii, l//i2}. 

Proof: The proof for Lemma 1 is straightforward. Using (I13a) - (ll3cl i. it can be seen that R™^{T>) > J2 ^ir(Di) if 

i _ 

l//^ma > max{l//ii, l//i2}. However, given the definitions in (|2]i and ([8]), it can be seen that i?™^(D) > ^i?ii(Di) 

i 

is impossible lfT9l . Therefore, l//ima < max{l//ii, l//i2}- B 
Lemma 2: Assume that there exist {Xi,Xj} and {X'^,X'j} such that X'^ < Xi < Xj < X'j. If Tr{Pii(A;)} = 

XTr{Pri(A;)}, then X4i(Ai) > E ^AK) as long as 1/A, > min{l/a,(fc)}. 

I I ; 

Proof: See Subsection IVI-AI in Appendix. ■ 

Lemma 2 states that, for any given {Ai,A2} such that I/A2 > min{l/Q:2(fc)} assuming Ai < A2, decreasing 

k 

min{Ai, A2} and increasing max{Ai, A2} while fixing the total power consumption leads to a smaller BC phase 
sum-rate than that achieved by using {Ai, A2}. 

Lemma 3: Assume that there exist {Ai, Xj} and {A^, A^} such that Ai < Xj, X[ > Ai and A^ > Xj, and 

i?„(A:) + 4-j(Aj) = i?„(A,) + 4j(A;) (15) 

then as long as A^ < Xj, it holds true that 

Tr{P„(A:)} + Tr{P,,(A,)} <Tr{P„(A.)} + Tr{P,,(A;)}. (16) 

Proof: See Subsection IVI-BI in Appendix. ■ 
Lemma 3 states that, for any given {Ai, A2}, decreasing min{Ai, A2} and increasing max{Ai, A2} such that the 

BC phase sum-rate is unchanged, the power consumption increases. 

Theorem 1: The optimal solution of the considered relay optimization problem always satisfies the following 

properties 

min I I = min I ^ I if Ai 7^ A2 (17a) 
lAi A2J iMi M2J 

= = min(^,4T !> if Ai = A2 (17b) 



Ai A2 Ka'AO 
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in which is the water-level obtained by waterfiUing p™^'*^ on uj,-i{k),yk G l^ijVi- 
Proof: See Subsection IVI-CI in Appendix. ■ 
According to the proof of Theorem 1, it can be seen that Ai 7^ A2 at optimality and consequently the equation 
in ( I17al ) holds when both of the following two conditions are satisfied: (i) the relay has sufficient power, i.e., 
1/A" > niin{l//i5, l/^ij}- and (ii) there is asymmetry between fif and fi2, i.e., min{l//i5, I/a*"} < l/^ma < 
max{l//i", 1/^2}- If either of the above two conditions is not satisfied, Ai = A2 at optimality and consequently 
the equation in jllhi holds. 

Theorem 2: The conditions ( |14a| ), ( |14b| i, ( |17a| i, and (|17b| i are sufficient and necessary to determine the optimal 
{Ai, A2} with minimum power consumption for the relay optimization problem among all {Ai, A2}'s that maximize 
the sum-rate i?**(B,DO). 

Proof: See Subsection IVI-DI in Appendix. ■ 
It should be noted that the power constraint (II Ibb is not always tight at optimality due to the constraints in (I14al l. 
( I14bl i (or equivalently ( I12ab . (I12bb ). ( II Tab , and (I17bb . Each of ( I14al ). ( I14bb . ( I17al ), and ( I17bl ) may refrain the relay 
from using its full power at optimality. The reason can be found from the proofs of Theorems 1 and 2. Specifically, 
( I14al ) and ( I17ab make sure that there is no superfluous power spent for relaying the signal in each direction while 
( I14bb and ( I17bb guarantee that the power consumption of the relay cannot be further reduced without reducing the 
sum-rate. 

Based on the above results in Theorem 1 and Theorem 2, the algorithm summarized in Table Uis proposed to find 
the optimal relay power allocation for the relay optimization problem. The algorithm can be briefly understood as 
follows. Step 1 performs initial power allocation and obtains the initial water level A*'. The water-levels A^ = A", Vi 
maximize i?'"^(B) among all possible {Ai, A2} combinations subject to the power Umit of the relay. Step 2 checks if 
min{i?ri(Bi), /Jjr(Dj)} is upper-bounded by /Jjr(Dj), Vi. If Ri-i{Xi) > -R2r(D2)' the relay reduces its transmission 
power allocated for relaying the signal from source node 2 to source node 1 so that _Rii(Ai) = /i'21 (D2) ^^^P ^■ 
In the case that _Rri(Ai) is reduced in Step 3, in terms of increasing Ai, extra power becomes available for relaying 
the signal from source node 1 to source node 2. Therefore, if _Rr2(A2) < Rir(Di), the remaining power of the 
relay is allocated for relaying the signal from source node 1 to source node 2 at first in Step 4. Later in Step 4, 
it is checked if Ri2{M) > ^ii(Di) under the new power allocation. If -Rr2(A2) > Rir(Di) in Step 4, the relay 
reduces its transmission power allocated for relaying the signal from source node 1 to source node 2 so that 
RMX2) = -Rir(D") in Step 5. Steps 6 checks if -Rri(Ai) + -Rr2(A2) < i?"'''(D"). In the case that this constraint 
is not satisfied. Step 6 or Step 7 revise the power allocation so that _Rri(Ai) + i?r2(A2) = i?'"^(D") and the power 
consumption of the relay is minimized. The above procedure in the proposed algorithm, which terminates after 
Step 6 or 7, is not iterative. 

The following theorem regarding the proposed algorithm is in order. 

Theorem 3: The water-levels obtained using the algorithm for relay optimization in Table |T] achieve the optimal 
relay power allocation for the considered relay optimization problem of sum-rate maximization with minimum relay 
power consumption. 
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TABLE I: The algorithm for relay optimization. 

1. Initial waterfilling: allocate pm^x uj^^[k),\fk e Xi,\/i 
using waterfilling. Denote the initial water level as 1/A". Set 
1/Ai = I/A2 = 1/A". The power allocated on u),-i{k) is 
p„{k) = (1/A, - l/a,(fc)) + ,Vfc gX„Vi. 

2. Check if l/A^ < I/m^ for both i = 1,2. If yes, proceed to 
Step 6. Otherwise, assume that 1/Ai > l/fJ-2, proceed to Step 3. 

3. Set Ai = Check if I/A2 < I/m?. If not, proceed to Step 4. 
Otherwise, proceed to Step 5. 

4. Calculate P/ = P™^"' - J2 Pri(fc). Allocate P/ on 
a;r2(fc)'s,VA: 6 I2 via waterfilling. Obtain the water level I/A2. 
If I/A2 > 1/Mi> proceed to Step 5. Otherwise, go to Step 6. 

5. Set A2 = fjfl and proceed to Step 6. 

6. If 1/A, > l/Mma:Vi, Set A; = Mma.Vi. Check if 1/A, < 
l/^5Jjjj,Vi. If yes, output Ai,Vi and break. Otherwise, check 
if E-^"{Ai) < ff^^^CD"). If yes, output A,,Vj and break. 

i 

Otherwise, proceed to Step 7. 

7. Assuming that Aj < A^, find A^ such that |AlJ.|logA^- = 

E logai(fc) - /{■"''(DO) + Hj,(DO), where = 

k€M + 

{1/xr - l/a,(fc)) + ,Vfc e Xj, M+ = {k\prj{k) > 0} and 
I is the cardinality of the set M^y Set Aj = A^ and output 
Ai and Aj. 



Proof: See Subsection IVI-EI in Appendix. ■ 
Depending on the source node power allocation strategies and the power limit at the relay, different results can be 
obtained at the output of the algorithm in Table|I] Define the power thresholds = S S (l/^ma^ 

i keli 

Pi = EJ: (l/iiiax{M;,/i^}-lM(fc))^,i^t=E E (l/M?-lM(fc))^andP,=E E (l/min{A*?,A*^}- 

i k^Xi i k^Xi i k^Xi 

\/ai{k))^ . Recall from Lemma 1 that /i^j^ > /i^}- 

For the case that /i^^^ ^ maxj/i]*, /ij}, the following subcases exit as p^^^ increases. If p^^^ is small such that 
pmax ^ P^^^ the algorithm proceeds through Steps 1-2-6 and 

A, = A">mS'„^,Vi (18a) 

^Tr{P„(A,)}=Pr"" (18b) 

i 

at the output of the algorithm, while ( |14at and ( |14bt are satisfied with inequality. Note that some power of the 
source nodes is wasted in this subcase. Since the sum-rate i?'"'(B,D) is bounded by i?ri(Ai) + i?r2(A2) due to the 
small power limit of the relay, the source nodes could use less power without reducing i?*^(B, D) if there would 
be coordination in the system. Indeed, if the source nodes could be coordinated to optimize their power allocation 
as well, they only need to use the power of TrlDljH-TrjDj}, where = [d|, D2] is the optimal solution to the 
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following problem 

min Tr{Di} + TrjDz} (19a) 

s.t. i?"^"(D) > 4i(A°) + 42(A°) (19b) 

^ir(Di) > A.2(A°) (19c) 

i?2r(D2) > i?rl(A°). (19d) 

It can be shown that Tr{D?} + Tr{D§} > Tr{D|} + Tr{D^} in this subcase. Therefore, the power of Tr{D5} + 
Tr{D^} - Tr{D|;} - Tr{T>l} is wasted at the source nodes because of the lack of coordination. 

Increasing p^^^^^ such that P„ia < P™^'^ < Pi, the algorithm proceeds through Steps 1-2-6. Increasing p™^'^ 
such that Pi < p^'"^'^ < Pj, the algorithm proceeds through Steps 1-2-3-4-6. Further increasing p™'^^ such that 
Pt < P™^'' < Pg, the algorithm proceeds through Steps 1-2-3-4-5-6. Further increasing pmax ^^^.j^ jj^^j pmax ^ p^^ 
the algorithm proceeds through Steps 1-2-3-5-6. In the above subcases, it holds that 

A. = Mma>A",Vi (20a) 



Tr{P.,(A,)} < Pr"" (20b) 



E 

i 

at the output of the algorithm, while ( I14al l is satisfied with inequality for each i such that 1/ fi^ > l/fiam ™d (I14bl l 
is satisfied with equality. For these subcases, the sum-rate P'"(B,D) is bounded by P'"^(D") and there is no 
waste of power at the source nodes. 

For the case that jj^^^^ < maxj/i]*, /ij}, it holds that mm{fj(, /ij} < /i^^ < maxl/X]*, /i^} according to Lemma 1. 
Assume that fj,^ > ^\ and find A2 such that Pr2(A2) = P'°''(D'') - P2r(D2)- Let Ai = ^§ and define P^a = 
^ ^ (l/Ai — l/ai(fc))^. It can be seen from Lemma 3 that Pma > -Pma- The following subcases appear as 
pmax increases. If p^'-^^^ is small such that p™^^ < Pi, the algorithm proceeds through Steps 1-2-6 and 

Aj A° > max{^?,^^},Vi (21a) 
^Tr{P„(AO} = P;""'^ (21b) 

i 

at the output of the algorithm, while (I14al i and (I14bl i are satisfied with inequality. Increasing p™'^^ such that 
Pi < p^'-^^^ < Pijia, the algorithm proceeds through Steps 1-2-3-4-6 and 

Ai = /i" > A" (22a) 
^Tr{P„(A,)}=P,'""'^ (22b) 

i 

at the output of the algorithm, while (I14ab is satisfied with equality for i — 1 and inequality for i — 2. Note 
that there is waste of power at the source nodes for the above two subcases as long as p^^'^ < p^^ because the 
sum-rate P*^(B, D) is bounded by 4i(Ai) + ^r2(A2). 

Increasing p™^'^ such that P„ia < P™'''^ < P^., the algorithm proceeds through Steps 1-2-3-4-6-7. Further 
increasing p™'^^ such that Pt < P™^'^ < Pg, the algorithm proceeds through Steps 1-2-3-4-5-6-7. Further increasing 
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"r2(fe) 







r2(rr2) 



(a) P,-"'"' < Pma, MlLa > max{AtO,AtO} 



1 

-Vr .^ImaL 



rl(l) 



■rv-L\- 



r2(l) 



1-2 W 



(b) P,-"-- < P,„a, ^i'^, < max{At'i',At!j} 



■■r2(>>2) 



Fig. 1: Illustration of fil, /i2, /^?na' ^i^^ for '^he scenario of relay optimization. 



pmax gy^jj jjj^j pmax ^ jjjg algorithm proceeds through Steps 1-2-3-5-6-7. In the subcases when P™'''' > 
it holds that 



(23a) 
(23b) 



Ai = > A° 
^Tr{P„(AO}<Pr" 

at the output of the algorithm, while (I14al l is satisfied with equality for i = 1 and inequality for i = 2, and (I14bl) 
is satisfied with equality. The optimal A2 is found in Step 7 of the proposed algorithm. For these subcases, there 
is no waste of power at the source nodes. 

Two of the above subcases, one for the subcase p™^^ < Mma ^ max{/X]', IJ.2} and the other for the subcase 
pmax ^ p^^^^o^^ < max{^5;M2}' illustrated in Fig. [T] 

From the above discussion, it can be seen that the algorithm in Table U obtains the optimal power allocation in 
at most seven steps without iterations. 

Recall that the sum-rate of DF TWR is bounded by both the sum-rate of the MA phase and the sum-rate of the 
BC phase. In the scenario of relay optimization, the relay optimizes its power allocation which affects the sum-rate 
of the BC phase. Since the relay may or may not use all its available power at optimality (i.e., for the optimal power 
allocation), the sum-rate of the BC phase is not necessarily maximized at optimality. Moreover, it is also possible 
that the sum-rate of the BC phase at optimality is not even the maximum sum-rate of the BC phase that can be 
achieved using the power consumed by the relay at optimality. We specify the term efficient to describe such optimal 
power allocation of the relay that maximizes the BC phase sum-rate i?'^'^(B) with the actually consumed power at 
the relay. Thus, the relay's power allocation is efficient if it generates the maximum sum-rate for broadcasting the 
messages of the source nodes given its power consumption. For example, when the relay uses all its available power 
at optimality, the optimal power allocation of the relay is efficient if it maximizes the sum-rate of the BC phase, 
and inefficient otherwise. When the relay uses the power P^ < p^'^^ at optimality, the optimal power allocation is 
efficient if the achieved sum-rate of the BC phase is the maximum achievable sum-rate of the BC phase with power 
consumption P,-, and inefficient otherwise. Then the following two conclusions can be drawn for the scenario of 
relay optimization. 
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First, the optimal relay power allocation in the scenario of relay optimization is always efficient for the case that 
M?na — maxj/i^, /ij}. In such a case, it can be seen from ( II Sai l and (I20ab that 1/Ai = I/A2 at optimality regardless 
of whether the relay uses all its available power Therefore, the BC phase sum-rate R^'^{'B) is always maximized 
given the relay's power consumption in this case. However, the optimal relay power allocation is inefficient for the 
case that f/^^^ < maxj/ijj/ij} as long as p™^'^ > Pj. Moreover, the larger the difference between ma,x{ii1, fJ^} 
and /i^jjj in this case, the more inefficient the optimal relay power allocation becomes when p™^'^ > Pj. Given the 
definitions ( |13at -( fT3cl l and Lemma 1, the case with /i^j^^ < maxj/i^j/i^} indicates that one source node uses more 
power, has more antennas and/or better channel condition compared to those of the other source node. Indeed, if 
the power budget, number of antennas, and channel conditions are the same for the two source nodes, as an extreme 
example, it leads to /i^-^j^ > /i° = ^2- Therefore, it can be seen that the asymmetry between the power budget, 
number of antennas, and/or channel conditions can degrade the relay power allocation efficiency in the scenario of 
relay optimization. 

Second, the considered scenario of relay optimization may result in the waste of power at the source nodes. 
However, the relay never wastes any power This is due to the fact that the relay is aware of the source node power 
allocation strategies and optimizes its own power allocation based on them. As a result, it can use only part of the 
available power if its power limit p™^'' is large. However, the relay power allocation strategy is unknown to the 
source nodes when the source nodes decide their power allocation strategies. Therefore, the possibility of wasting 
power in the relay optimization scenario can be viewed as the tradeoff for low complexity. Indeed, in the scenario 
of relay optimization, there is no coordination between the relay and the source nodes. As a result, it is almost 
impossible to achieve the maximum sum-rate with minimum total power consumption referred to as network-level 
optimaUty. In order to achieve the network-level optimality, the scenario of network optimization, in which the relay 
and the source nodes jointly maximize the sum-rate of the TWR with minimum power consumption, is considered 
in Part II of this two-part paper 

IV. Simulations 

In this section, we provide simulation examples for some results presented earlier and demonstrate the proposed 
algorithm for relay optimization in Table H] The general setup is as follows. The elements of the channels Hii and 
Hii, Vi are generated from complex Gaussian distribution with zero mean and unit covariance. The noise variances 
cr|, Vi and <Tj"? are equal to each other and denoted uniformly as a^. While the source node power allocation strategy 
D*^ can be arbitrary, we use for simulations the that maximizes the MA phase sum-rate P™^(D). The rates 
P™^(D), Rij-CDi), and R^iCBi) are briefly denoted as P™^, Ri^ and R^, respectively, in the figures in this section. 

Example 1: A demonstration of Lemma 2. It is assumed that the number of antennas at the relay is 8 while 
source node 1 has ni ~ 6 antennas and source node 2 has n2 = 5 antennas. Each curve in Fig. |2] shows the sum-rate 
Rii + Ri-2 versus the water-level 1/Ai for a given ratio of p™''^ over cr^. In each curve, for each given 1/Ai, the 
relay consumes all the remaining power to maximize I/A2. Therefore, the power consumption of the relay is fixed 
and equals P™^^. For each curve, is different. The curve at the bottom corresponds to the ratio P^^'^/cr^ equal 
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Fig. 2; Rri + Rr2 versus 1/Ai under different P;"^'^/cr^ 
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(a) Hri in the optimal solution of the sum-rate maximizing problems (b) Relay power consumption, i?™''(D'') and J^-Rri in the optimal 
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with and without minimizing power consumption, respectively, versus solution of the sum-rate maximizing problems with and without 
P™'"' minimizing power consumption, respectively, versus P™^^ 

Fig. 3: Illustration of relay optimization. 



to 4 dB. For each time, when the ratio of p^^^ over increases, a new curve of + R^2 versus 1/Ai, which 
lies above the previous curve, is plotted. The curve at the top corresponds to the ratio P^^^/a^ equal to 7 dB. 
It can be seen from Fig. |2]that the sum-rate i?ri + ^12 is a nonconvex function of 1/Ai. However, i?ri + ^12 is 
non-decreasing in the interval from the minimum 1/Ai to the sum-rate maximizing 1/Ai and non-increasing from 
the sum-rate maximizing 1/Ai to the maximum 1/Ai. Note that 1/Ai = I/A2 = 1/A" when the BC phase sum-rate 
is maximized. As a result, it can be seen that increasing max{l/Ai, I/A2} and decreasing min{l/Ai, I/A2} while 
fixing the total power consumption leads to a smaller BC phase sum-rate for any given {1/Ai, I/A2}. Therefore, 
Fig. |2] verifies the result presented in Lemma 2. 

Example 2: The relay optimization problem. Fig. [3a| compares the BC phase rates at optimality of the relay 
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optimization problem, which considers power consumption minimization, with the BC phase rates at optimaHty of 
the problem ( fTTT l. which does not minimize the power consumption, under different P™^'^. One channel realization 
is shown. The specific setup for this simulation is as follows. The number of antennas ni,n2, and n^- are set to 
be 6, 5 and 8, respectively. The power limits for the source nodes are set to be = pmax = 3 w xhe noise 

variance is normalized so that cr^ = 1. The MA phase rates for this channel realization are 20.7 for i?™^(D°), 
11.2 for ^11.(0?), and 11.0 for ^2r(D§). In Fig. [Sa] R'^^ represents ^„(B-), where B^'s, Vi are the optimal 
solution (obtained using CVX ||23l ) to the problem (fTTT l which does not minimize the power consumption, and R^-i 
represents Ri-i(Bi), where B,;'s, Vi are the optimal solution to the relay optimization problem considering power 
consumption minimization obtained using the algorithm in Table U It can be seen from Fig. [3a] that R[.^ — R^i 
when p™''^ is small. The reason is that R'^^ is small when p™^^ is below certain threshold. As a result, the 
constraints in (fT2l l are always satisfied and the solutions to the problem (fTTT l and the relay optimization problem 
are the same. As p™^'^ increases, i?*™(B,D°) becomes larger and is finally bounded by i?'°'^(D°), while the 
relay power consumption is not necessarily minimized in the solution of the problem ( fTTT l which does not consider 
power consumption minimization. This can be seen from the first subplot of Fig. |3b] which shows that the power 
consumption in the solution derived using the proposed algorithm, denoted as P^, saturates when p™^^ > 4.9 W, 
while the power consumption in the solution to the problem (fTTT l which does not consider power consumption 
minimization, denoted as P}, keeps increasing. As a result, as can be seen from the second subplot of Fig. |3b] 
X; RT^ never exceeds i?"^'^(D*'), while Ki grows beyond i?'"^(D") when i?*^(B, D") is bounded by ^"^=^(0°). 

i i 

Meanwhile, it can also be seen from the second subplot of Fig. [3b]that the maximum sum-rates i?'*(B, D°) for the 
two compared solutions are the same, both of which equal to ^ R[^ = ^ R^i when ^ R[^ < i?™^(D'') and equal 

i i 

to R™''^{T)^) when '}2,R[i > R'°^^{T)'^). Thus, this example demonstrates that the proposed algorithm in Table H] 
achieves maximum sum-rate in the scenario of relay optimization with minimum power consumption. 

Example 3: The effect of asymmetry. The specific setup for this example is as follows. The noise variance is 
normalized so that cr^ = 1. The number of antennas at the relay, i.e., n^-, is set to be 6. The power limit of the relay, 
i.e., p^^^^ is set to be 3 W. The total number of antennas at both source nodes is fixed such that ni+n2 = 6. The 
total available power at both source nodes is also fixed such that pmax^pmax — 5 ^ Given the above total number 
of antennas and total available power at the source nodes, the relay optimization problem is solved for different m, 
n2, P™^'', and p^^^ for 1000 channel realizations. The resulting average sum-rate and average power consumption 
of the relay, and the percentage of efficient power allocation at optimality are plotted in Figs. Ha] |4b] and f4cT 
respectively, versus the difference between the number of antennas and the difference between the power limits 
at the source nodes. From Fig. |4al it can be seen that the sum-rate at optimaUty of the relay optimization is the 
largest when there is no asymmetry in the number of antennas at the source nodes and no asymmetry or only 
small asymmetry in the power limits of the source nodes. As the asymmetry becomes larger in either number of 
antennas or power limits, the sum-rate at optimality of the relay optimization decreases. Therefore, it can be seen 
from this figure that the asymmetry in the above aspects leads to smaller sum-rate at optimality of the considered 
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„j PS'" -PJ'" (W) 

(c) Percentage of efficient power allocation at optimality. 



Fig. 4: Effect of asymmetry: the average sum-rate, average relay power consumption, and percentage of efficient 
power allocation at optimality of relay optimization versus the difference between number of antennas and the 
difference between power limits at the source nodes in 1000 channel realizations. 

relay optimization problem. Relating Figs. |4b] and |4c] to Fig. |4al two more observations can be made. First, the 
relay does not necessarily use all the available power for sum-rate maximization in the relay optimization scenario. 
Second, the asymmetry in number of antennas and power limits leads to low power allocation efficiency. It can be 
seen from Fig. |4b] that when one of p™^'^ — pmax _ positive while the other is negative, the relay 

uses a part of its available power However, the achieved sum-rate is smaller compared to the sum-rate in the case 
when p™^^ — pmax _ g — = (sec Fig. Hill. In this situation, since the average power consumption 

and the average sum-rate are both low, the percentage of efficient power allocation is larger than but less than 
the percentage when p^^^ — pmax _ g — n2 =0, as can be seen from Fig. |4c] When p™^^ — pmax 

and ni — n2 are both positive or both negative, the relay uses more power than the power used in the case when 
pmax _ pmax _ g m — n2 — whilc the achieved sum-rate is smaller than that in the latter case. In this 
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situation, since the average power consumption is high while the average sum-rate is low, the percentage of efficient 
power allocation is very low, if not zero, as can be seen from Fig. |4c] The above facts become more obvious when 
the asymmetry becomes larger Therefore, it can be seen from Figs. |4b] and |4c] that the asymmetry on the power 
limits and the number of antennas can lead to low power allocation efficiency. 



In Part I of this two-part paper, we have solved the problem of sum-rate maximization with minimum power 
consumption for MIMO DF TWR in the scenario of relay optimization. For finding the optimal solution, we have 
proved the sufficient and necessary optimality condition for power allocation. Based on this condition, we have 
proposed an algorithm to find the optimal solution. The proposed algorithm allows the relay to obtain its optimal 
power allocation in several steps. We have shown that, as a trade-off for low complexity, there can be waste of power 
at the source nodes in the relay optimization scenario because of the lack of coordination. We have also shown that 
the asymmetry in the number of antennas and power limits at the source nodes can result in the degradation of the 
sum-rate performance and the power allocation efficiency in MIMO DF TWR. Next, in Part II of this two-part paper, 
we will investigate the scenario in which the relay and the source nodes jointly optimize their transmit strategies 
to achieve the network-level optimaUty of sum-rate maximization with minimum total power consumption for the 



A. Proof of Lemma 2 

Lemma 2 is proved in two steps, i.e.. Steps A and B. In Step A, we prove that ^ Rii{X[) can be increased by 



modifying the current power allocation on two specific subchannels. In Step B, we show that J^^i-iiK) ^'^y 



further increased. 

Step A: J2 ^riiK) ^^'^ tie increased. Given the fact that X]Tr{Pr;(A/)} X]Tr{Pr;(AJ)}, it can be shown 
that 1/A^ > min{l/ai(fc)} as long as 1/Aj > mm{l / a j{k)}. As a result, there exist fci and ^2 such that 

k k 

1/A^ > l/a,(fci) and l/A^ > l/aj(fc2). Define f{p„{ki)) = log {l + a^{kl)pr^{kl)) + log (l + (fc2)Prj (fc2)) 
where Prj{k2) = p—Pn{ki) and p is a positive constant. It can be seen that f{pri{ki)) is strictly concave in Pri{ki) £ 
[0,p],\/p > 0. Set p = (l/A^- — 1/0^(^2))^ + 1/A^ — 1/Qi(fci). The optimal allocation of the power p on ai{ki) and 
a,(fc2) that maximizes is p„(fci) = (1/A°p*(p) - l/a,(fci))+ and ^^^(^2) = (l/A°P*(p) - l/aj(fc2))^ 

where A°P*(p) is a function of p and l/A°P*(p) is the optimal water level. It can be shown that l/A°P*(p) < 1/A^. 
There exist two cases, i.e., l/A°P*(p) < 1/X, and l/A°P*(p) > 1/A,. In the case when l/A°P*(p) < 1/A,, it follows 
that (l/A°P*(p)-l/Q:,,(fci))^ < (l/Aj-l/Q!j(fci))+ < l/A^-l/aj(fci). The power allocation on fci and fc2 using 



V. Conclusion 



MIMO DF TWR. 



VI. Appendix 



A^ and A^ is 




(24a) 



(24b) 
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Since /(pri(fci)) is strictly concave as mentioned above, it can be seen that the power allocation 



1 1 ^ + 



Pr.(fci) = T TTT (25a) 



Prj{k2) = ( V TTT 

~^ K ai{ki) ai(fci)) ^^^^^ 

which reduces Priiki) and increases p-[j{k2), both by l/A'^ — 1/Q;i(fci) — (l/A^ — 1/Q:i(fci))^, yields higher /(pr,;(fci)) 
than the power allocation in ( |24] |. 

Therefore, the sum-rate ^ ^ log (l + ai{k)p-ci{k)') achieved using dZST l and 

Pr«(fc) = f ^ - — , Vfc e I, \ {fci} (26a) 

P.(^)=(^-^)^Vfcez,\{M (26b) 

is larger than X]^r;(^0- This is the first step of increasing sum-rate. Moreover, it can be seen that there exists A^ 

I 

such that 



Tr{P„(A:)} 



1 



A- ai{ki) J \\i ai{ki) ^ 
Tr{P,,(A,)}-5]Tr{PH(A;)} (27b) 



1 1 ^ + 



and the power allocation 



\K a,{k)j 

p„ (fc) =(^- , Vfc e I, (28c) 

VAj aj(fc)/ 

which spreads the power l/X'^ — l/ai(fci) — (l/Ai — l/ai(fci))^ over aj(fc)'s, Vfc e X,, achieves even higher 
sum-rate than that achieved by the power allocation specified by ( |25] l and ( |26] l. This is the second step of increasing 

the sum-rate. 

For the second case in which l/Xi < l/A°P*(p) < l/A'^, the following process is adopted. Similar to the two 

steps of increasing the sum-rate in the first case, the sum-rate X^X^l^S (l + cii{k)pii{k)) increases after each of 

/ fc 

the following two adjustments of power allocation. First, reduce Prj(fci) from 1/A^ — l/ai(fci) to (l/A°P*(p) — 
l/ai(fci))^. Then, spread the reduced power 1/A^ — l/ai(ki) — (l/A°P'(p) — 1/Q!i(fci))^ over aj(fc)'s , fc e X, 
by finding and using l/Aj which satisfies 

+ 



Tr{P„(AD}- 



7 I ^ I A°P'(p) Q.(fci) 



-Tr{P,,(A;.)}-ETr{Pr/(A;)}. (29) 
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After the adjustments, it is straightforward to see that the total power allocated on fci and fc2 is reduced from 
p= (l/A;.-l/a,(fc2))^ + l/A',-l/a,(A:i) top= (l/A^. - l/a,(fc2))^ + (1/A°p'(p) + . In consequence, 
there exists a new optimal water level l/A°P'(p) based on which the optimal allocation of the power p, i.e., Pr j(fci) ~ 
(l/A°P'(p) - and ^^(fcs) = 1/A°p'(p) - 1/0^(^:2), maximizes f{p„{ki)) when p in f{p„{ki)) is 

substituted by p. Since p < p, it can be seen that l/A°P*(p) < l/A°P'(p). Update p and l/A°P'(p) so that p = p 
and l/A°P'(p) = l/A°P'(p). Then the above process of reducing to (l/A°P*(p) - l/ai{ki))^, finding the 

new 1/X'j and the new l/A°P*(p) can be repeated until a). l/A°P*(p) < 1/A, or until b). l/A°P'(p) < l/ai{ki). 
The former matches the condition for the first case discussed in the previous paragraph and therefore can be dealt 
with in the same way as in the first case, which leads to ( |28] |. The latter implies that 1/Ai < l/A°P*(p) < 
in which case the power allocation can also be equivalently written as ( |28] |. Note that during this process the 
sum-rate J2J2^'^S + ai(^)-PrK^)) increases. Therefore, summarizing the above two cases of l/A°P*(p) < l/A; 

I k 

and l/A°P*(p) > 1/Ai, it is proved that the sum-rate can be increased by reducing Pri{ki) from l/A'^ — l/ai{ki) 
to (l/Ai — l/Q;i(A:i))^ and using the power allocation in (l28T l. 

Step B: J2^y:i{^i) may be further increased. Keep the above selected fc2 unchanged. As long as there exists k 
such that Pri{k) = (l/A'j — l/Q;i(A:i)) and Pii{k) > 0, this k can be selected as fci and the procedure of reducing 
Pri(fci) from — l/ai(ki) to (l/A^ — l/ai(/ci))^ and spreading the reduced power over aj{k)'s, Vfc G Tj as 
specified in ( |28] | can be performed. This process can be repeated until pnik) = (l/Ai — l/ai{k))^ ,\fk e {q G 
Xi\{l/X'^ - l/a,{q))^ > 0} and p,,{k) = 0,Vfc e {q e Zi\{l/X'^ - ^/aiiq))^ = 0}. Note that the sum-rate 
log (1 + ct.i{k)pii{k)) increases in the above process for every qualifying ki. The resulting power allocation 

I k 

on ai(fc)'s, Vfc G li is equivalent to Pri{k) = (l/A^ — l/aj;(fc)) ,Vfc G 2i since (l/Ai — l/ai(fc)) = if 
(l/A^ — 1/Q!i(fc))^ = 0. From the procedure described in the previous paragraphes, the resulting power allocation 

on Q!j(fc)'s, Vfc G Ij is Prj{k) = (l/Aj — l/aj(fc))^, Vfc. According to the power constraint X Ti"{Pri(A;)} = 

I 

^ Tr{Pi.;(AJ)} and the fact that the total power consumption is fixed at all time, it can be seen that 1/Aj — 1/Aj. 
Summarizing the above two steps. Lemma 2 is proved. ■ 

B. Proof of Lemma 3 

Given that A- < Aj, we have A^ < A- < Xj < X'y According to Lemma 2, there exists Xi < A- such that 

Tr{P,,(A9}+Tr{P,,(A,)} 

= Tr{P„(A,)} + Tr{Pr,(A;-)} (30) 

and 

Rr^{X'i) + RrjiXj) > 4.(A.) + 4j(A;). (31) 

Therefore, given that 

4.(AD + R,,{Xj) - R„{X,) + 4j(A;) (32) 
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it is necessary that > A^. As a result, it leads to 

Tr{P„(AO}+Tr{P,,(A,)} 

<Tr{P„(A,)}+Tr{P,,(A;.)}. (33) 
Lemma 3 is thereby proved. ■ 

C. Proof of Theorem 1 

First we prove that the optimal water-levels must satisfy condition (|17a| i. It can be seen that the maximum 
i?*'^(B, D) is achieved with minimum power consumption using Ai = A2 = maxjA*^, /i^jal when min{l/ /i?, 1/^*2} ^ 
l//^ma at the optimality. Therefore, it is necessary that min{l//x", l//i2} < 1/Mma given that Ai 7^ A2 at optimality. 
Let us consider the case when min{l/ Ai, I/A2} = 1/Ai < I/A2 at optimality. According to the constraint (ll4aK we 
have that 1/Ai < 1/^2 optimality. Similarly, it can be seen that I/A2 < I/m? at optimality. Since 1/Ai < I/A2, 
it leads to the result that 1/Ai < 1/^2 < at optimality. Assuming that min{l//i°, l/zi^} 7^ l/^^i at optimality 
when Ai ^ A2, it infers that 1/Ai < l/ji^ < I/A2. However, it can be seen that the power allocation using 
1/Ai < l/fJ,2 < 1/^2 does not provide the maximum achievable i?''^(B, D) according to Lemma 2. Consequently, 
the resulting power allocation is not optimal. It contradicts the assumption that mm{l / nl, 1 / 1I2} 7^ 1/Ai at 
optimality. Thus, the above assumption is invalid and it is necessary that min{l//i°, 1/^2} = at optimality 
when Ai A2. Similarly, it can be proved that mm{l/ nl,!/ ^12} — at optimality when Ai A2 for the 

case when min{l/Ai, I/A2} = I/A2 < 1/Ai. Therefore, it always holds true that min{j^, jj} = ^i'^l^i ^f'''} 
Ai ^ A2. 

Next we prove that the optimal water-levels must satisfy condition ( I17bl i. It is straightforward to see that 
1/Ai = I/A2 < l/A*^. Moreover, according to the constraints (I14at and (I14bt . it is not difficult to see that 
1/Ai = I/A2 < min{l/^'j', 1/^°!^} when 1/Ai = I/A2 at optimality. Indeed, if 1/Ai = I/A2 > l/fi^,^, then 
( I14bl i cannot be satisfied. If 1/Ai — I/X2 > min{l//i]', I/1J2}, then ( I14al i cannot be satisfied. Combining the above 
two facts, we have 1/Ai = I/A2 < min{l/iJ,i, l//i2i l/z^ma; 1/^°} when 1/Ai = I/A2 at optimality. For the case 
that min{l/ /ij. 1 /1J.2} > the above constraint can be written as 1/Ai = I/A2 < min{l/ /i^jai ^/^^}- For this 

case, it is straightforward to see that the achieved sum-rate is not maximized if 1/Ai = I/A2 < min{l//i°jjj, 1/A"}. 
Therefore, the optimal water-levels must satisfy condition (I17bb when min{l//i5, 1/^2} ^ l//^ma given that 
1/Ai = I/A2. For the case when min{l//^", l/zi"} < it can be seen that 1/A° < min{l//i", l/zi^} 

given that 1/Ai = I/A2 at optimality. Otherwise, it can be shown that either of the following two results 
must occur. If 1/A*' > minll/zi]*, l//i2} and 1/Ai = I/A2 < min{l//i]', l/zi"}. then the sum-rate can be 
increased. If 1/A" > min{l//x", l/zi^} and 1/Ai = I/A2 > min{l//x", l/zi^}, then the constraint ( I14ab cannot 
be satisfied. Therefore, given that 1/A" < mm{l/ fil,!/ PI2} for '^he case when mm{l/ jJl,!/ ^12} < l/^ma 
and 1/Ai = I/A2 at optimality, we have l/A*' < min{l//i", 1/^*2} < l/A'ma- Consequently, the constraint 
1/Ai = I/A2 < min{l^;, can be rewritten as 1/Ai = I/A2 < 1/AO = min{l/^o„^, 1/A"}. It 

is straightforward to see for this case that 1/Ai = I/A2 < 1/A" does not maximize the sum-rate. Therefore, it can 
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also be concluded that 1/Ai = I/A2 = 1/A" = mm{l / fJ^-^^, 1/A"} when niin{l//i]', I/M2} < l/^ma- Combining 
the above two cases of min{l//i5, l//^2} — l/^ma minjl//^", l/zi^} < l//^?na' i'^ '-^'^ ^^^i^ '^^at the optimal 
water-levels always satisfy condition illhi given that 1/Ai = I/A2. 

The above two parts complete the proof of Theorem 1 . ■ 

D. Proof of Theorem 2 

The necessity of the constraints ( |14at and ( |14b| l is straightforward. It can be seen that the power consumption 
can be reduced without reducing the sum-rate i?'^(B, D) when these constraints are not satisfied. The necessity of 
the constraints dlTab and ( I17bl i is proved in Theorem 1 in Section IVI-CI Therefore, we next prove the sufficiency 
of the constraints (I14al i. ( I14bb . (I17al i, and (I17bl l. 

We use proof by contradiction. Assume that the above constrains are not sufficient to determine the optimal 
{Ai,A2} with minimum power consumption among all {Ai, A2}'s that maximize the sum-rate i?'"(B,D). Then 
there exists {AJ, Aj} satisfying (fT4l l and ( |17a| i-( fT7b] i that maximizes the sum-rate and does not minimize the power 
consumption. Consequently, at least one of 1/ A| and '^^^ reduced without reducing i?*^ (B, D). We consider 
the following two cases. The first case is when \\ ^ \\ while the second case is when \\ = \\. In the first case, 
{A|, A2} satisfies (I17ab and it is straightforward to see that reducing min{l/A|, l/Aj} is not optimal according to 
Lemma 3. Reducing max{l/A|, I/A2}, on the other hand, necessarily leads to the decrease of i?'"(B,D) given 
that ( I14bb is satisfied. Therefore, reducing either of 1/ A| and results in the decrease of the sum-rate, which 
contradicts the previous assumption. In the second case, {AJjAj} satisfies ( I17bb . According to Theorem 2, it is 
necessary that = = min{l//i"-jj^, 1/A"}. From Lemma 2, it can be seen that it is not optimal to reduce 
only one of 1/A| and 1 /Aj. Reducing both of 1/A| and 1 / aJ, on the other hand, necessarily leads to the decrease of 
i?'^(B, D) given that ( I14bb is satisfied. Therefore, it is impossible that there exists {A|, A2} with A| = A2, satisfying 
(fl4] l and (I17bb . that maximizes the sum-rate while the resulting power consumption can be reduced. Combining the 
above two cases, it can be seen that the power consumption cannot be reduced given that the {A|, Aj} maximizes 
the sum-rate subject to the relay power limit and satisfies (O and ( I17ab -( ll7bb . This contradicts the assumption 
that the above constrains are not sufficient to determine the optimal {Ai, A2} with minimum power consumption 
among all {Ai, A2}'s that maximize i?'^(B,D). This completes the proof for Theorem 2. ■ 

E. Proof of Theorem 3 

The optimality of the pair {Ai, A2} obtained using the algorithm in Table|T]is proved in three steps: A) Steps 2-5 
of the algorithm in Table |I] find {Ai, A2} that maximizes i?'^'^(B, D*') with minimum power consumption subject 
to the constraint in (fTTb and the constraint (I14ab . B) The pair {Ai, A2} obtained from Steps 2-5 of the algorithm in 
Table H] needs to be modified to maximize the objective function in (fTTT i with minimum power consumption. Step 6 
of the algorithm in Table |T] deals with two cases in which {Ai, A2} obtained from the previous steps can be simply 
modified to obtain the optimal pair {Ai, A2}. C) Step 7 of the algorithm in Table U deals with the remaining case 
which is more complicated and finds the corresponding optimal pair {Ai, A2} in this case. It is not difficult to see 



January 10, 2013 



DRAFT 



22 



that the constraint in (fTTT i is always satisfied in any step of the proposed algorithm. It can also be seen that Steps 1, 
2 and 6 ensure that ( I17bb is satisfied if Ai = A2 at the output of the algorithm while Steps 3 to 5 ensure that (I17ab 
is satisfied if Ai ^ A2 at the output. Therefore, in the following we only consider the constraints (I14ab and (I14bl) . 
which are equivalent to the constraints in (fTSI i. 

A. Steps 2-5 find the pair {Ai, A2} that maximizes i?(B,D") with minimum power consumption subject to the 
constraint ( I14ab . Note that the maximum i?(B,D'') with minimum power consumption is achieved by i?ii(Ai) + 
^r2('^2) for some specific {Ai,A2} if (|14ab is satisfied. Therefore, it is equivalent to finding the {Ai,A2} that 
maximizes _Ri.i(Ai) + i?r2(^2) subject to (|14ab . The initial power allocation in Step 1 of the algorithm in Table U 
using 1/Ai = I/A2 = 1/A" maximizes i?i.i(Ai) + i?r2('^2)- Regarding the constraint (I14ab . the following cases are 
possible. 

A-1. Ai > /i^, Vi. In this case, the constraint ( I14ab is satisfied and {A°,A"} is the desired {Ai,A2}. 

A-2. Ai < /i" and Aj > /i^. In this case, the constraint (I14ab is not satisfied for i. The relay power consumption 
can be reduced without decreasing -R(B, D°) by increasing Ai until Ai = Then, -R(B, D") can be increased by 
decreasing Aj until the relay power limit is reached or until \j = 

A-3. Ai < n^jj^i- In this case, it is straightforward to see that the pair {Ai, A2} that maximizes i?(B,D*') with 
minimum power consumption subject to the constraint (I14ab satisfies Ai — 

The above three cases are determined in Step 2. Case A-1 is dealt with in Step 2 of the algorithm in Table H 
Case A-2 is dealt with in Steps 3 and 4. Case A-3 is dealt with in Steps 3 and 5. 

B. Steps 6 and 7 of the algorithm in TableUfind the optimal pair {Ai, A2} that maximizes the objective function 
in (fTTb with minimum power consumption. Since i?'°''(D°) < Rij-CDl) + /i'2r(D2), it can be seen that Ai,Vi 
should either increase or remain the same in order to satisfy the constraint ( |14bb given that the constraint ( |14ab is 
satisfied. Therefore, the optimal power allocation can be derived by increasing Ai and/or A2, if necessary, based on 
the power allocation derived from Steps 1-5. Regarding the constraint (I14bb . the following cases are possible. 

B-1. Aj > Mma.^* or (^^i > Mma' ^3 < A^ma ^nd -Rri(Ai) + Rr2{>^2) < R'"^"" (D'^)^ . In this case, the constraint 
(|14bb is satisfied and the current {Ai,A2} is optimal. 

B-2. Ai < A^ma, Vi and ^ri(Ai) + ^12(^2) > i?™''(D"). In this case, it is not difficult to see that it is optimal to 
simply set Ai = Mma^^^- 

B-3. A, > fil^^, Xj < AiO„a and 4i(Ai) + 4-2(A2) > i?'"^(D"). 

Cases B-1 and B-2 are simple and dealt with in Step 6 of the algorithm in Table J] It can be shown that in these 
two cases the constraints ( I14ab and ( I14bb are both necessary and sufficient for finding the optimal power allocation 
in terms of maximizing the sum-rate with minimum power consumption. Case B-3 is dealt with in Step 7. The 
optimal strategy in Case B-3, as in Step 7 of the algorithm in Table IH is to increase Aj while keeping Ai unchanged 
until _Rii(Ai) + -Ri2(A2) = i?'"*(D"). In order to prove that this strategy is optimal, the following three points are 
necessary and sufficient. 

1. It is optimal to increase min{Ai}. 

i 

2. X, = fi"^ if A, > ^o„^ and Aj < fi°^^. 
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3. At optimality, the increased Xj, denoted as A^, satisfies Xj < X'j < /i"ia- 

The first point states that it is optimal to increase Xj as long as Xj < A^. The second point infers that it is not 
optimal to decrease A^. The third point infers that A^ is always larger than Xi and therefore it is not optimal to 
increase A^ at any time. The first point follows from Lemma 3. For the second point, assume that Xi > fJj. It follows 

that p^^^^^ is used up, i.e., p^^^^ = S (V^; ^ l/o^K^'))^- Otherwise, the equality in the constraint (I14al i is not 

; k 

achieved for i and the objective function in ( fTTT ) can be increased by decreasing Xi, which contradicts Steps 1-5 

of the algorithm in Table |T] Given that A; > ^" and P^ax = (l/A; - l/ai{k))^, it can be proved that 

; k 

> Otherwise, the power allocation can be proved not optimal based on Lemma 2 because the objective 

function in ( fTTT i is not maximized subject to the constraint (I14al ). which contradicts Steps 1-5 of the algorithm in 
Table HI However, the conclusion that l/A; > 1/Xj contradicts Case B-3 in which Xi > /ij^^: Aj < Mma- Thus, the 
assumption that Xi > f/j is invalid. Since Xi > ^/j at the output of Steps 1-5 of the algorithm in Table H] we have 
Aj = For the third point, assume that A^- > fi'^^. Then it follows that -Rri(Ai) + -Rr2(A2) < i?™''(D°) , which 
is not optimal. Therefore, A^ < f/^^^ at optimality of Case B-3. 

C. Finally, we prove that A^ found in Step 7 of the algorithm in Table J] for Case B-3 is optimal. The optimal 
X'j for Case B-3 is the solution to the following optimization problem 

1 



(34a) 

s.t. 4,(A0 + 4,(A;) = i?'"^(DO). (34b) 
Using the definition that Pi-i{k) = (l/A,; — l/ai{k))^ and ~ {k\pj-i{k) > 0}, the constraint in (l34l l is equal to 

Rr^iX^)+ log ^ = i?'"^ (0°) . (35) 

keM+ ^ 

As previously proved, A^ — in Case B-3, which means that i?ri(Ai) — Rj^CD^). Thus, the above equation can 
be written as 



Y log^=i?-'^(D°)-4,(D°). (36) 

keM+ ^ 

Therefore, the optimal A^ satisfies 

\M+\\ogX'j^ E log«,(fc)-ff""(D") + 4..(D«) (37) 

keM+ 

and the optimality of the water level A^ found in Step 7 of the algorithm in Table |l] is proved. 

The proof of Theorem 3 is thereby complete. ■ 
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